LLM 0.24 introduces fragments and template plugins to better utilize long context models, improving storage efficiency and enabling new features like querying logs by fragment and leveraging documentation. It also details improvements to template handling and model support.
A review of the Qwen2.5-VL-32B large language model, noting its performance, capabilities, and how it runs on a 64GB Mac. Includes a demonstration with a map image and performance statistics.
Simon Willison discusses his experience using Large Language Models (LLMs) for coding, providing detailed advice on how to effectively use LLMs to augment coding abilities, set reasonable expectations, manage context, and more.
A guide on using large language models (LLMs) for programming tasks, including examples, strategies, and useful tips for effectively using AI assistants like ChatGPT and Claude.
Simon Willison discusses the release of llm-anthropic 0.14, which adds support for Claude 3.7 Sonnet's new features. Key features include extended thinking mode, a massive increase in output limits, and improved support for long tasks. The article also covers the plugin's implementation details and limitations.
OpenAI's documentation for their o1 and o3 'reasoning models' includes tips on how to best prompt them, such as using developer messages, delimiters, and specific instructions.
Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.
Simon Willison shares his predictions regarding the development of AI and LLMs over the next 1, 3, and 6 years. He discusses the potential failure of AI agents to fully realize their expected capabilities, the success of coding and research assistants, a Pulitzer prize for AI-assisted investigative reporting within three years, the emergence of privacy laws, the creation of amazing art in six years, and concerns about AGI/ASI leading to mass civil unrest.
A review of advancements and key themes in Large Language Models over the course of 2024, including GPT-4 barrier breaking, reduced costs, multimodal capabilities, and more.
Concatenate a directory full of files into a single prompt for use with LLMs